Meta-learning for Indian languages: Performance analysis and improvements with linguistic similarity measures

نویسندگان

چکیده

Indian languages share a lot of overlap in acoustic and linguistic content. Though different use writing systems, the phoneme sets logically overlap. Most these are low-resourced, lacking enough annotated speech data to build good automatic recognition (ASR) systems. Recently proposed model-agnostic meta-learning (MAML) algorithm has shown great success fast adaptation multilingual models unseen datasets. In this work, we establish usefulness MAML pretraining quickly building reasonably ASRs for low-resource languages. significantly outperforms joint training its capability few-shot learning faster adaptation. On average, yields absolute improvements 5.4% CER 20.3% WER over fast-adaptation setting with five epoch fine-tuning. Further, exploit similarities source transcriptions target through loss-weighing scheme during improve performance models. Similarity-based loss-weighings yield 0.2% 1% on average.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New distance and similarity measures for hesitant fuzzy soft sets

The hesitant fuzzy soft set (HFSS), as a combination of hesitant fuzzy and soft sets, is regarded as a useful tool for dealing with the uncertainty and ambiguity of real-world problems. In HFSSs, each element is defined in terms of several parameters with arbitrary membership degrees. In addition, distance and similarity measures are considered as the important tools in different areas such as ...

متن کامل

Evaluation of Similarity Measures for Template Matching

Image matching is a critical process in various photogrammetry, computer vision and remote sensing applications such as image registration, 3D model reconstruction, change detection, image fusion, pattern recognition, autonomous navigation, and digital elevation model (DEM) generation and orientation. The primary goal of the image matching process is to establish the correspondence between two ...

متن کامل

SOME SIMILARITY MEASURES FOR PICTURE FUZZY SETS AND THEIR APPLICATIONS

In this work, we shall present some novel process to measure the similarity between picture fuzzy sets. Firstly, we adopt the concept of intuitionistic fuzzy sets, interval-valued intuitionistic fuzzy sets and picture fuzzy sets. Secondly, we develop some similarity measures between picture fuzzy sets, such as, cosine similarity measure, weighted cosine similarity measure, set-theoretic similar...

متن کامل

Word Similarity Datasets for Indian Languages: Annotation and Baseline Systems

With the advent of word representations, word similarity tasks are becoming increasing popular as an evaluation metric for the quality of the representations. In this paper, we present manually annotated monolingual word similarity datasets of six Indian languages – Urdu, Telugu, Marathi, Punjabi, Tamil and Gujarati. These languages are most spoken Indian languages worldwide after Hindi and Ben...

متن کامل

Using Meta-Languages for Learning

This paper proposes the use of metalanguages to provide an eeective representation scheme for Machine Learning. This proposal is supported by both: { a demonstration of the beneets that metalanguages can bring to Inductive Logic Programming , and { the successful and widespread use of metalanguages in other search intensive areas of Artiicial Intelligence, such as planning.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3300790